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NORMALITY IN PISOT NUMERATION SYSTEMS 


ADRIAN-MARIA SCHEERER 


Abstract. Copeland and Erdos [6] showed that the concatenation of primes when writ¬ 
ten in base 10 yields a real number that is normal to base 10. We generalize this result 
to Pisot number bases in which all integers have finite expansion. 


1. Introduction 

Let X be a real number and 6 ^ 2 a positive integer. Then x has a 6-adic representation 
of the form 

OO 

X = [x\ + ^ eib~^ 

i=l 

where Si G {0,1,.. . & — 1} are the digits of x and [xj denotes the integer part. We call 
X normal to base b, if any block d = did 2 ... dk oi k ^ 1 digits occurs with the expected 
frequency in the 6-adic representation of x. This means that 

lim -Nd{x,n) = 
n—>-oo n m 

where Nd{x,n) counts the occurrences of the block d within the hrst n digits of x. A real 
number x is called absolutely normal if it is normal to every base b ^ 2. 

The terminology of a normal number can be extended to the context when the underlying 
base is no longer an integer. Renyi |22] introduced and Parry [21] studied numeration 
systems with respect to real bases /3 > 1. Each real number x has a representation of the 
form 

— OO 

X = 

i=L 

with digits G {0,1,..., f/d] — 1}. One way to produce the digits is the so-called greedy 
algorithm using the transformation : x i—)■ /9x (mod 1) on the unit-interval. In a natural 
way, corresponds to the shift-operator on the set IT°° of right-inhnite sequences over 
{0,1,..., f/S] — 1} and each x corresponds to its sequence of digits. A sequence oj in 1U°° is 
called fi-normal for a given shift-invariant measure p on if all possible hnite patterns 
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of digits occur in u with asymptotic frequency given by fi. Consequently, the real x is called 
fi-normal if its sequence of digits is /i-normal. Details will be made clear in the next section. 

From a modern approach, using Birkhoff’s point-wise ergodic theorem, it is immediate 
that almost all numbers are normal to a hxed 0 base b. The map Tb : x ^ bx (mod 1) 
on the unit-interval is the underlying ergodic transformation which preserves Lebesgue- 
measure. Knowing this, the existence of normal numbers to base fe is in a certain sense not 
very surprising. However, in this context, there are two observations that make the study 
of normal numbers interesting. 

First: The explicit construction of normal numbers. The study of normal numbers dates 
back to Borel [1], who in 1909 showed that Lebesgue-almost all numbers are absolutely 
normal. However, the hrst explicit example of a normal number is due to Champernowne [5] 
in 1933. He showed that the concatenation of integers, when written in base 10, 

0,1 2 3 4 5 6 7 8 9 10 11 12 13..., 
is normal to base 10. Copeland and Erdos [6] showed that 

0,2 3 5 7 11 13 17 19..., 

i.e. the concatenation of primes in base 10 is normal to base 10. Besicovitch [2] showed 
that the decimal formed by concatenation of the squares in base 10 is normal to base 
10. This construction was extended to general integer-valued polynomials by Davenport 
and Erdos [7] and by Schiffer [23] and Nakai and Shiokawa [T8] . to more general 
polynomial settings. Nakai and Shiokawa 123 also evaluated polynomials at primes, and 
Madritsch [15] showed that numbers generated by pseudo-polynomial sequences along the 
primes are normal. Further constructions of normal numbers in the spirit of Copeland and 
Erdos and Erdos and Davenport include [H] and im. 

These constructions, most notably the one due to Champernowne, have subsequently 
been generalized to other number systems. To mention are the works by Ito and Sh¬ 
iokawa ra who generalized the Champernowne-construction to real bases /3 > 1, and by 
Madritsch and Mance [16] who modihed the construction to produce normal sequences in 
general symbolic dynamical systems. Bertrand-Mathis and Volkmann [1] give a generalized 
Copeland-Erdos construction to symbolic dynamical systems. 

In this paper we prove a polynomial Copeland-Erdds-construction to bases which are 
not integers. We use results from the work of Bertrand-Mathis and Volkmann [1] which in 

^Borel’s result also follows, but for the moment we want to restrict the discussion to one single base. 
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turn is extending the original work of Copeland and Erdos |n|. 


Second: Not all numbers are normal (almost all cannot be improved to all, hence Tj, is 
not uniquely ergodic), so the ergodic theorem is strict. This can be seen in context of the 
following ‘test’ for ergodicity. 


Theorem 1.1 (See Theorem 1.4 in [3]). Let J^q be a fieloQ generating T. If T 
every A in IFq, then T is ergodic. 



Hence normal numbers, or rather the existence of non-normal numbers, can be used to 


test the underlying transformation for unique ergodicity. Although the Lebesgue-measure 
is not the only Tfe-invariant probability measure on the unit-interval, it is the only one 
that maximizes entropy (see the following section). Accepting this for the moment as a 
dehnition of uniqueness, it is possible to study normality in greater generality with respect 
to a given transformation that can be different from T^. 

2. Preliminaries 

In the following we present a condensed introduction to /^-expansions, Pisot numbers 
and symbolic dynamical systems - the context in which we want to state our result. 

Let /3 > 1 be a hxed real number. A fl-expansion of a non-negative real number x is a 
representation of x as a sum of integer powers of ft of the form 


— OO 



( 2 . 1 ) 


i=L 


where the digits £* G {0,1,... f/S] — 1} are obtained by the following greedy algorithm. Let 
L G Z such that ft^ ^ x < and put El = [x/ft^\ and = {x/ft^}. For L ^ i > —oo, 
dehne recursively Ei = [/3rj+iJ and rj = {/^rj+i}. /^-expansions have been introduced and 
studied by Renyi [22] and Parry [21]. 

Let Tg be the ft-transformation Tg : [0,1) —)■ [0,1), x i-A {ftx}. The digits in the ft- 
expansion 12.11 are given by Ei = \_ftT^~^ (x)\. Renyi [22] showed that there is a unique 
normalized measure fig on [0,1) that is invariant under Tg and equivalent to the Lebesgue 
measure. This measure also maximizes the entropy of the corresponding symbolic dynam¬ 
ical system and we use it to dehne normal numbers in base /3, see below. 


^The underlying sigma-field J- is in our case the Borel sigma-algebra which is generated by 6-adic 


intervals. 

^Almost every orbit under T visits A with the expected asymptotic frequency. 
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A Pisot number /3 is a real algebraic integer (3 > 1 such that all its conjugates have 
absolute value less than 1. For a Pisot number of degree d, we denote its conjugates by 
i = 2,..., d, and the corresponding conjugations by cxj. 

We work with Pisot numbers such that every positive integer has hnite /3-expansion. A 
criterion for when this is the case can for example be found in [9]. 

In our notation concerning symbolic dynamical systems we follow Bertrand-Mathis and 
Volkmann [1]. 

Let A be a hnite alphabet. A* is the set of all hnite (possibly empty) words over A and 
is the set of all (right-)inhnite words over A. We call a subset L of A* a language. L* 
denotes the set of all hnite concatenations of words from L. Let W{L*) be the set of all 
non-empty factors of words in L*. For a hnite word ca we let ||a;|| be its length. A language 
L is said to be connecting of order j ^ 0 if for any two words a, & G hF {L*) there is a word 
u = M(a, h) e W{L*) of length j such that auh G W{L*). For each a, 6 G W{L*) we choose 
one u = M(a, h) and introduce the notation a © 6 := aub. In the applications we have in 
mind, this intermediary word will simply consist of O’s. We write W{L*) = Ln where 
Ln is the subset of words in W{L*) of length n. Also denote by the subset of W{L*) of 
words of length less or equal to n. For a language L we denote by W°° = W°°{L) the set of 
all inhnite words generated by L, i.e. the set of all to = 0102 ... such that ajOj+i .. .ak E L 
for all 1 ^ i < /c < cxo. 

We introduce the discrete topology on the alphabet A and the corresponding product 
topology on the set of sequences A^. With each language L we associate the symbolic 
dynamical system 

where W°° = B is the a-algebra generated by all cylinder sets of A^, i.e. sets of 

the form 

c{uj) = {aia 2 ... G A^ | 0102 ... a„ = cn} 

for some word ca G A* of length n. T is the shift operator and I is the set of all T-invariant 
probability measures n on B. We will write niuj) instead of /i(c(ci;)) for a hnite word uj. 

With each symbolic dynamical system Sl 'we associate the entropy 

h{W°°) = sup h(/i), 

MS/ 

where h{fi) denotes the entropy of the measure jj, (cf. Chapter 2 of Billingsley [3]). For 
hnite alphabets it is known that there always exists a unique measure fi that maximizes 
the entropy, i.e. such that h{W°°) = h{fi) (e.g. Proposition 19.13 in [8]). In the context of 
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the symbolic dynamical system generated by a /^-expansion the measnre np is precisely the 
measnre with maximnm entropy eqnal to log (3 (cf. [12] and the work by Bertrand-Mathis 
and Volkmann m)- In the following, we will work with this maximal measnre. 

For an inhnite word to = toit 02 ... G W°° and a block d = did 2 ... dfc G L we denote with 
Nd{oj,n) the nnmber of occnrrences of d within the hrst n letters of uo. If the word uo is 
hnite, we denote by Ndioo) the occnrrences of d in it. An inhnite word to G W°° is called 
^-normal or fi-normal sequence if for all d G W{L*) 

lim —Nd{to,n) = fi{d). 

n—^oo Tl 

We note that a non-negative real nnmber x is fip-normal if and only if the seqnence of 
digits in its /^-expansion is a qip-noimal seqnence. 

In this terminology, the main resnlt of Bertrand-Mathis and Volkmann [T] is the following 

Theorem 2.1. Let L be a connecting language and 01 , 02 ,... a sequence of different ele¬ 
ments ofW{L*), ||oi|| ^ ||oi|| ^ ..., satisfying the generalised Copeland-Erdos condition: 

Ve > 0 Bno{e) Vn ^ uq jj{oy | ||oiy|| ^ n} > 

Then the infinite word o = 01 O 2 ... G W°° is normal. 

The symbolic dynamical system generated by the (3-shift (or corresponding to the (3- 
expansion) arises natnrally when viewing the /^-expansions of real nnmbers x G [0,1) 
as inhnite words over the alphabet {0,1,... [/I]}. W°° is the set of these right-inhnite 
seqnences, B the a-algebra generated by all cylinder sets, T the shift operator (it corre¬ 
sponds to the /^-transformation Tp), and I the set of all T-invariant probability measnres 
on B. We work with the nniqne entropy-maximizing measnre pi in I. It corresponds to pip 
on [0,1) in the sense that for a hnite word to the measnre of the cylinder set p.{to) is the 
same as the measnre pip{Co) of the set Co of all real nnmbers in [0,1) whose /3-expansion 
starts with uo. We allow ns to speak of these two concepts interchangeably. 

For a given Pisot nnmber /3, denote by {n)p the word over {0,1,... [/3] — 1} that cor¬ 
responds to the /3-expansion of the positive integer n. We prove the following polynomial 
generalization of [6|. 

Theorem 2.2. Let (3 be a Pisot number such that all integers have finite fi-expansion 
and let the measure ftp be as before. Let f be a polynomial of degree g that maps positive 
integers to positive integers. Then 

(/(2))/3 © (/(3))/3 © {f{5))p © {f{7))p © {f{ll))p © ... 
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is a fip-normal sequence. 

In the context of the dynamical system generated by the /3-shift, the entropy is log/3. 
Hence by Lemma 2 of [1], we have bonnds on the number of words of length n in W{L*). 
For n sufficiently large we have /3” |L„| /3"', where the implied constants do not 

depend on n. Therefore 

n 

(2,2) |L'J = li-l « 

12=1 

for all large n. 


3. General Case 

First we need upper and lower bounds of the length of the /3-expansion of integers. Under 
the assumption of /3 being a Pisot number such that all integers have hnite /3-expansions 
we can in fact show that the lengths of these expansions are asymptotically of logarithmic 
order of magnitude. 

Note that if n = integer part and Yl,7=-'i 

fractional part. In the following we will think of n as hxed and omit writing the dependency 
on it in the lengths L and R. 

Lemma 3.1. Let (3 he a Pisot number of degree d such that all natural numbers have finite 
(3-expansion. For the length R{n) of the fractional part of n upper and lower bounds of the 
following form hold, for sufficiently large n: 

6 log n ^ R{n) ^ 6 ' log n 

where 5 is a positive constant (specified in the proof) and the difference 6 ' — S > 0 can be 
chosen to be arbitrarily small. 

Proof. We have -pfri ^ [O;!)- Following an argument of Proposition 3.5 Frougny and 
Steiner [10], for a certain number k the number Tp{-^f+i) is an element of the hnite set 

K = {y£Z|/3]n|0.1) I |g,(y)| < 1+ for 2<jSd}. 

To see this, let the /3-expansion of n for the moment be 8182 ■ ■ ■ and put 2 ; := Then 

for all /c ^ 0, 

k 

T^{z) = /fry'(z) - er = ... = /fL - 

1=1 
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Hence for all fc ^ 0 and 2 ^ j ^ d, 

k 

Let k be eqnal to 


max 


= L -|- 1 -|- max 

logn 


^og\(Tj{fi)-^ 




which we write as 

k = L + 1 + (51ogn + 0(1), where 6 := max -—;— , . 

Note that the 0(1) constant coming from the ceiling lies in [0,1). For this choice of /c, 

T^(^) € y. 

Let W be the maximum length of the /^-expansions of the elements in Y. We therefore 
obtain asymptotic bounds of R, 

S \ogn ^ R ^ W + 1 + S \ogn ^ 6 ' \ogn, 

where 6 ' > 6 can be chosen arbitrarily close. □ 

In the course of the proof of Theorem 12.21 we deal with a problem caused by a constant 
coming from the length of the words we want to patch together. This issue can be cir¬ 
cumvented by dividing the words into smaller subwords and glueing them back together 
afterwards. The following lemma ensures normality of the resulting word when patched 
back together. 

Lemma 3.2. Let v = ViV 2 Vs ... and w = W 1 W 2 W 3 ... be fi-normal words such that ||nj|| = 
Ikill -^00 and assume that the quantities m— j, —it o,nd n— n, ^, n —r, tend to zero 
as N tends to infinity. Then the word 

U = ViWiV 2 W 2 V^W^ ... 

is fi-normal. 

Proof. We work with a hxed hnite string d = di... dk of length k. Since v is p-normal, 
we have 

N 

Nd{v,n) = '^Nfivi) + 0{N) + 0(||nAr+i||) —^ fi{d)n 

i=l 

as n —)■ 00 , where N is chosen such that ||ni|| ||nv|| ^ n < ||ni|| -f- ... -f ||njv+i||. 

The 0{N) contribution comes from possible occurrences in-between two words Vi and Uj+i. 
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Hence by the assumptions 

N 

'^Nd{vi) —^ n{d)n 
i=l 

as n —?■ oo. Then, by arguing similarly, 

N N 

Nd{u,n) = '^Ndivi) + '^Nd{wi) + 0{N) + 0(||M;Ar+i||) = + /i(c?)^ + o(n). 

i=l i=\ 

Here, N is chosen such that ||ni|| + ||tci|| + ... + ||n 7 v|| + W^nW ^ n < ||ni|| + ||tci|| + ... + 
||nAr+i|| + lltCAr+ill. This shows the normality of u. □ 

To verify the conditions of Lemma 13.21 in our application, we need some basic number 
theoretic input. 

Lemma 3.3. Denote by the N-th prime number. We have for N ^ oo 

( ^-^- N - 

logpi + . . . + logpAT logpi + . . . + logpAT 

Proof. This is a consequence of the prime number theorem. We have 

Pn 

log Pi = 9{pn) r^pN ^ N log N. 

i=l 

□ 

With these preliminaries we can prove our theorem. 

Proof of Theorem 12.21 The polynomial f{n) = + ... + ain + oq behaves asymptot¬ 

ically like . Hence for any e > 0 and any n large enough 

f{n) ^ (1 -I- e)agn^ and f{n) ^ (1 — e)agn^. 

Thus a consequence of Lemma 13.11 we have the upper bounds 

-R(/W) ^ -R((l + £)agn^) 

^ h'(log(l e) -h log ag + g log n) 

^ C' log n 

for some constant C > 6'g arbitrarily close and n large enough. Similarly we obtain lower 
bounds of the form 

R{f{n)) ^ Clogn, 

where C < 6g is a. positive constant and can be chosen arbitrarily close if n is assumed to 
be large enough. 
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A direct consequence is an asymptotic upper bound for the total length of f{n) when 
written in base f3\ 


\\{f{n))p\\=L{f{n)) + l + R{fin)) 

^ L((l + e)agn^) + 1 + /2((1 + e)agn^) 

g\ogn + log ttg 


< 


^ C 


log/3 

logn 

log/3’ 


+ 0(1) + C logn 


for some (other) constant C. Note that C only depends of / and /3. 

However, applying Theorem 12.11 directly does not work as we do not have control of 
the size of the constant C. This can be avoided by choosing an integer m ^ 0 such that 
CjTR ^ 1 and dividing the words (/(n))^ in 2™ words of (almost) equal length. Then we 
can apply Theorem 12.11 to show normality of the concatenations of those shorter words. 
They can subsequently be patched back together applying Lemmamultiple times. Note 
that the lower bounds for R enable us to use Lemma [3.21 

For a prime number p we have ||(/(p))^|| ^ so ||(/(p))/ 3 |l ^ iV is implied by 

^ equivalent to 

p ^ /3^/^. 

Thus, counting primes below (3^^^, 


7r(/3^/^) 


pNIC 

N/C log/3 


log/3 


for any e < 1 arbitrarily close and N large enough. Here we see why we require C ^ 1, 
namely so that the condition of Theorem 12.11 


7r(/3^/^) ^ (/3^)i-" 


for any e' > 0, is implied by (3^^^ being eventually greater than Inserting 

the intermediary word to obtain admissibility in base /3 does not destroy the normality of 
the sequence since we are inserting a word of constant length. □ 


4. Special Case: Golden Ratio Base 

Let p = be the golden ratio, i.e. the dominating root of the polynomial — x — 1. 
All positive integers have hnite <p-expansion (see for example Theorem 2 of 0 )- Considering 
this special case is interesting insofar that we can provide an exact formula on the length 
of the fractional part. Let n be a positive integer and denote by L + 1 and R the lengths 
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of its integer and fractional part when written in base tp. From the greedy algorithm we 
already know that 


(p ^ n < if 


log<^" 


The following description of the fractional part is borrowed from m- 


Lemma 4.1. We have 


R 


L , L even 
L \ ^ L odd. 


Proof. Recall that is the conjugate of ip. Let a 2 be the corresponding conjugation. 
Since it hxes the integers, we have 

L ^ R 

n = (T 2 {n) = ^ Ski - f = ^ {-lfe_k^^- 

k=-R ^ k=-L 

The last sum can only be positive when R is even. In this case we have n < ip^ + ip^~‘^ + 
^R-4 _ _ _ _ and since = 0 also n > (p^ — ip^~^ — ip^~^ — ... = ip^~^. This 

together with ip^ ^ n < tp^^^ shows that R is the even integer satisfying L ^ R ^ L + 1. 

□ 


The normality to base ip of the word 

(2)^0(3)^0(5)^0... 

is then shown by hrst applying the generalized Copeland-Erdos criterion Theorem 12.11 to 
the concatenations of only fractional or integer part and subsequently patching them back 
together using Lemma [3.21 once. 
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